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Edited by R. McKenna, University of Middle East respiratory syndrome coronavirus (MERS-CoV) is a human 
Florida, USA pathogen responsible for a severe respiratory illness that emerged in 2012. 
Structural information about the proteins that constitute the viral particle is 
+ These authors contributed equally. scarce. In order to contribute to a better understanding of the nucleoprotein (N) 
in charge of RNA genome encapsidation, the structure of the C-terminal 
ep ORO INC CORiOte In MOGI Eat domain of N from MERS-CoV obtained using single-crystal X-ray diffraction is 


respiratory syndrome coronavirus; MERS-CoV; : : ; : ‘ 
ieee reported here at 1.97 A resolution. The molecule is present as a dimer in the 


crystal structure and this oligomerization state is confirmed in solution, as 
measured by additional methods including small-angle X-ray scattering 


Coronaviridae; X-ray diffraction; SAXS. 


PDB reference: C-terminal domain of 


MERS-CoV nucleocapsid, 6g13 measurements. Comparisons with the structures of the C-terminal domains of 
N from other coronaviruses reveals a high degree of structural conservation 
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the surface of the protein. 


1. Introduction 


Middle East respiratory syndrome coronavirus (MERS-CoV) 
is the etiological agent responsible for outbreaks of a severe 
respiratory disease in the Middle East, with sporadic occur- 
rence in Europe, Africa, Asia and America. MERS-CoV 
belongs to lineage C of the Betacoronavirus genus within the 
Coronaviridae family, one of the four viral families (together 
with Arteriviridae, Mesoniviridae and Roniviridae) that form 
the Nidovirales order. This group of viruses includes the 
largest-sized RNA genomes known, i.e. Roniviridae (~26 kb) 
and Coronaviridae (from 26.3 to 31.7 kb), as well as the 
medium-sized Mesoniviridae (20.2 kb) and the small-sized 
Arteriviridae (12.7-15.7 kb) (Adams et al., 2013). Despite 
having a positive-sense single-stranded RNA genome, all of 
these viruses protect their genomes of dissimilar size with a 
nucleoprotein (N). The 5’ part of the MERS-CoV genome 
codes for the polyproteins ppla and pplab, which are 
processed into 16 nonstructural proteins involved in the 
replication complex. The 3’ part of the genome codes for viral 
structural proteins and several accessory proteins, which are 
translated from a set of subgenomic mRNAs. Among these 
proteins, the nucleoprotein (N) is a multifunctional protein 
that plays roles in genome encapsidation (reviewed in Chang 
et al., 2014), RNA synthesis during replication, host trans- 
© 2019 International Union of Crystallography lational shutoff and innate immunity disruption (McBride et 
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al., 2014). The coronavirus N is a 40 kDa protein organized 
into two folded domains that are called the N-terminal domain 
(NTD) and the C-terminal domain (CTD). They are separated 
by a disordered region (called LKR) containing a serine/ 
arginine stretch which could regulate the functions of N upon 
phosphorylation (He et al., 2004; Surjit et al., 2005; Peng et al., 
2008; Wu et al., 2009). Despite many studies, the mechanism by 
which the RNA genome is encapsidated by N has not been 
fully unraveled. Indeed, the structure of full-length N is not 
known, probably owing to its flexibility, and the structures of 
the NID and the CTD have been explored separately. 
However, structural and functional data led to a consensus 
model in which the formation of the ribonucleocapsid (RNP) 
requires both multi-site interactions of the RNA along N and 
oligomerization of N (McBride et al., 2014). In this model, 
dimers of N generated through a CTD-CTD dimerization 
process represent the basic module of the RNP (Yu et al., 2006; 
Jayaram et al., 2006). A recent low-resolution electron- 
microscopy analysis of N from Murine hepatitis virus seems to 
converge towards a proposal in which N assembles as small 
oligomers (from dimers to tetramers) and forms a helical 
structure in complex with the RNA genome (Gui et al., 2017). 

The structure of the MERS-CoV NTD has been determined 
using X-ray crystallography (Papageorgiou et al., 2016). The 
overall structure is conserved and consists of a five-stranded 
antiparallel 6-sheet core domain with disordered loops of 
varying length connecting the strands. 

Sequence analysis of the C-terminal region of MERS-CoV 
N suggests that the CTD is 122 amino acids in length (amino 
acids 239-361 of the MERS-CoV N sequence) followed by an 
intrinsically disordered region of 50 amino acids. The struc- 
tures of CTDs from Human coronavirus NL63 (HCoV-NL63), 
Avian infectious bronchitis virus (IBV), Severe acute 
respiratory syndrome coronavirus (SARS-CoV) and Murine 
hepatitis virus (MHV) have been determined. The structures 
revealed a conserved structural organization in which the 
CTD forms a dimer with an intermolecular antiparallel four- 
stranded 6-sheet (Yu et al., 2006; Jayaram et al., 2006; Chen et 
al., 2007; Ma et al., 2010; Szelazek et al., 2017). This 6-sheet is 
surrounded by eight a-helices and 3, -helices. The CTD 
displays a positively charged area, highlighting that the CTD is 
a possible RNA-binding module. However, the surface of the 
charged area can vary from one viral isoform to another, 
suggesting variability in its RNA-binding affinity (Chen et al., 
2007). Several studies support the idea that RNA binding can 
occur along the N protein, with multiple binding sites acting 
cooperatively to form a ribonucleocapsid, as reviewed by 
Chang et al. (2014). However, no structure of full-length 
coronavirus N is available to support this hypothesis, most 
likely because the recombinant protein is not stable in solu- 
tion. 

In this study, we present the crystallographic structure of 
the MERS-CoV CTD at a resolution of 1.97 A. In addition, we 
report size-exclusion chromatography coupled to multi-angle 
light scattering (SEC-MALLS) and small-angle X-ray scat- 
tering (SAXS) studies of the CTD in order to gain insight into 
its structural organization in solution. 
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2. Materials and methods 
2.1. Protein production, purification and characterization 


The codon-optimized DNA encoding the CTD (amino acids 
239-362) of the MERS-CoV nucleoprotein (strain Betacorona- 
virus England 1, accession No. KC164505) was synthesized by 
GenScript. The coding sequence of the CTD was amplified by 
a two-step PCR before cloning into the expression vector 
pMCOX20A to enable the fusion of a cleavable thioredoxin/ 
6x His tag at the N-terminus (Lantez et al., 2011). The protein 
was then produced in Escherichia coli T7 Express (DE3) cells 
(New England Biolabs). Expression was induced overnight at 
290 K in Terrific Broth medium with 0.5 mM IPTG when the 
OD600 nm Of the culture reached 0.6. The purification of the 
protein and tag removal was performed under nondenaturing 
conditions, as described previously (Lantez et al., 2011). The 
final preparative size-exclusion chromatography (SEC) step 
was performed in 10 mM HEPES, 300 mM NaCl pH 7.5. In 
order to confirm the oligomerization state of the CTD, 
analytical SEC with online multi-angle laser light scattering, 
absorbance and refractive-index (MALLS/UV/RI) detectors 
was carried out on an Alliance 2695 HPLC system (Waters) 
using a Silica Gel KW802.5 column (Shodex) equilibrated in 
10 mM HEPES, 150mM NaCl pH 7.5. Measurement was 
performed using a triple-angle light-scattering detector 
(miniIDAWN TREOS, Wyatt Technology), a quasi-elastic 
light-scattering instrument (DynaPro, Wyatt Technology) and 
a differential refractometer (Optilab rEX, Wyatt Technology). 
Determination of the molecular weight from the MALLS data 
was performed using the ASTRA V software (Wyatt Tech- 


nology). 


2.2. Protein crystallization 


Crystallization trials were performed by the sitting-drop 
vapor-diffusion method at 293 K in a 96-well Swissci plate 
using a nanodrop-dispensing robot (Mosquito, TTP Labtech) 
by screening three different ratios of protein (300, 200 and 
100 nl) versus a constant volume of precipitant (100 nl). Wide 
intensive screening conditions were explored and crystals 
were found in condition No. 10 (0.2 M ammonium nitrate, 
2.2 M ammonium sulfate) from The AmSO, Suite (Qiagen). 
Optimal crystallization conditions were obtained by mixing 
200 nl protein solution at 11.5 mg ml“! with 100 nl reservoir 
solution. Crystals appeared within 48 h and were harvested 
after a week. The crystals of the MERS-CoV CTD were 
cryoprotected using 100% trimethylamine N-oxide (TMAO) 
prior to flash-cooling in liquid nitrogen at 110 K. 


2.3. Data collection and processing 


Diffraction data were collected on the ID23 beamline at the 
European Synchrotron Radiation Facility (ESRF), Grenoble, 
France using the large in-vacuum PILATUS 6M-F detector 
(Dectris, Switzerland) at a wavelength of 0.9787 A with 0.1° 
oscillation, 0.037 s exposure time and a temperature of 100 K. 
A total of 1400 images were measured and were subsequently 
processed using the autoPROC toolbox (Vonrhein et al., 
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2011). The data were integrated using XDS (Kabsch, 2010), 
and were then analysed with POINTLESS (Evans, 2006) and 
scaled with AIMLESS (Evans & Murshudov, 2013). The 
MERS-CoV CTD data were truncated at 1.97 A resolution to 
satisfy completeness and J/o(/) > 2 statistics. Phasing was 
performed by molecular replacement using Phaser (McCoy et 
al., 2007). A search model was generated using CHAINSAW 
(Stein, 2008) after first using Expresso (Di Tommaso et al., 
2011) to identify the structure with the closest sequence 
homology (46% identity) as that of the SARS CoV CTD, 
which was obtained by NMR (PDB entry 2jw8; Takeda et al., 
2008) and by X-ray diffraction (PDB entry 2cjr; Chen et al., 
2007). Refinement was performed by successive and alternate 
rounds of refinement with BUSTER (Bricogne et al., 2017) and 
model improvement using Coot (Emsley et al., 2010). The final 
model was evaluated using MolProbity (Williams et al., 2018). 

Surface electrostatics were calculated using APBS (Baker et 
al., 2001). Figures showing structures and alignments were 
prepared with UCSF Chimera (Pettersen et al., 2004) and with 
SeaView (Gouy et al., 2010) and ESPript/ENDscript (Robert & 
Gouet, 2014), respectively. The X-ray structure was validated 
and deposited in the Protein Data Bank with accession code 
6¢13. Data-collection and refinement statistics are shown in 
Table 1. 


2.4. Small-angle X-ray scattering (SAXS) 


SAXS measurements were recorded on beamline BM29 at 
the ESRF using a PILATUS 1M detector at a sample-to- 
detector distance of 2.43 m and a wavelength of 0.09919 nm 
(12.51 keV). SAXS intensity was measured in the range 
0.025-5 nm’. The scattering vector s is defined here as s = 
(47r/X.)sin@, where A is the wavelength of the incident radiation 
in nanometres and @ is half of the angle between the incident 
and scattered radiation. Measurements were obtained at seven 
protein concentrations in the range 0.14-8.51 mg ml’. The 
protein buffer consisted of 10 mM HEPES, 300 mM NaCl pH 
7.5. All samples were centrifuged for 15 min at 15 000g prior 
to the experiment to minimize the contribution from aggre- 
gated particles. Measurements were made for each protein 
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Figure 1 


MALLS/UV/refractometry/SEC analysis. The right y axis represents the 
OD at 280 nm and the left y axis represents the molar mass in kDa. The 
absorption peaks are shown as a continuous black line; the blue dots 
indicate the molar mass. 


10 Nguyen et al. 


- C-terminal region of the MERS-CoV nucleoprotein 


Table 1 


Data processing, structure solution and refinement. 


Values in parentheses are for the highest resolution shell. 


Data processing 


Wavelength (A) 0.9787 

Space group P3,21 

a, b, c (A) 120.7, 120.7, 92.6 
a, B, y (°) 90.0, 90.0, 120.0 


Resolution range (A) 
Total No. of reflections 
No. of unique reflections 


69.33-1.97 (2.04-1.97) 
110454 (10949) 
55298 (5473) 


Completeness (%) 100 (99.96) 
Multiplicity 2.0 (2.0) 

(I/a(1)) 15.91 (2.47) 
Resa 0.03473 (0.4165) 
CCi 2 0.998 (0.91) 
Wilson B factor (A?) 43.9 


Structure solution and refinement 


No. of reflections, working set 55294 (5475) 


No. of reflections, test set (5.27%) 2912 (303) 
Reryst 0.19 
Riree 0.22 
No. of non-H atoms 
Total 3970 
Protein 3632 
Ligand 28 
Water 310 
R.m.s. deviations 
Bonds (A) 0.013 
Angles (°) P 1.64 
Average B factors (A’) 
Overall 64.97 
Protein 64.76 
Ligand 74.70 
Water 66.57 
Ramachandran plot 
Favored (%) 98.17 
Allowed (%) 1.83 
Outliers (%) 0 
Clashscore 4.43 


concentration at 4°C by injecting 45 pl protein sample into a 
1.8 mm capillary with flow to minimize radiation damage. 

Each measurement consisted of ten exposures of 1 s, which 
were subsequently averaged into a unique intensity profile for 
each measurement. Buffer reference SAXS measurements 
were performed before and after the measurement for each 
protein sample. The forward scattering intensity was calli- 
brated using BSA as a reference at 5 mg ml’. 

Data were processed with the ATSAS package (Petoukhov 
& Svergun, 2007). The ten frames of 1 s exposure were aver- 
aged using PRIMUS (Konarev et al., 2003). Frames affected by 
radiation damage were excluded, while the buffer background 
was subtracted from the sample-scattered intensity signal. 


3. Results and discussion 
3.1. MERS-CoV CTD behaves as a dimer in solution 


Characterization of the purified CTD protein using ana- 
lytical SEC with online multi-angle laser light scattering (SEC- 
MALLS) shows that the protein is mainly organized in 
particles of 29 kDa (elution peak at 20.5 min) corresponding 
to the molecular weight of a CID dimer. A second minor 
population (elution peak at 22.5 min) that was only observed 
in the analytical SEC and that also displays an apparent 
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molecular weight (MW) of 29 kDa probably corresponds to 
the CTD interacting with the silica matrix. Thus, recombinant 
MERS-CoV CTD is organized in dimers in solution (Fig. 1); 
this result is consistent with previous observations and with 
the proposal that N dimerizes through its CTD, forming the 
core unit leading to assembly of the N polymer (Yu et al. , 2006; 
Jayaram et al., 2006). 

To further characterize the oligomerization state of the 
protein in solution, we studied the protein using SAXS. 
Fig. 2(a) shows the Guinier region sR, = 0.1—1.29 considering a 
globular type of protein, showing that no trend towards 
aggregation is present with increasing concentration. Data are 
shown as open circles and linear regression of data is shown as 
red lines. For all concentrations we found the same value of 


Ry =2.0+0.3 nm 


0.1 0.2 0.3 0.4 0.5 0.6 
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(a) 
1 
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0 
0 1 2 a 4 5 6 
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Figure 2 


radius of gyration R,, with a mean value (taken over all seven 
concentrations) of R, = 2.0 + 0.3 nm. Mass estimation using 
the SAXS profile of a standard BSA molecule returns a 
molecular weight (MW) of 29.32 kDa, which is in agreement 
with the previous SEC/MALLS measurements. This value is 
compatible with the MW of a dimer, knowing that the calcu- 
lated MW of the monomer based on the protein sequence is 
14.22 kDa. Again, this result is consistent with previous 
observations (Yu et al., 2006; Jayaram et al., 2006) and shows 
that this dimer is not transient but rather stable as it is 
not significantly affected by low or high protein concen- 
trations. 

In order to interpret our data and propose a molecular 
model, we used a merged data profile truncated between the 


4 
3] R, = 2.0+0.2nm 

Drax = 6-49 nM 
2 


In(/) 
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SAXS analysis of the MERS-CoV CTD. (a) Guinier plot for all concentrations showing the absence of aggregation. (b) GNOM fit of the merged profile. 
(c) Distance distribution P(r) of the merged data. (d) CRYSOL fit of the model (PDB entry 6213) with the experimental data (blue line) and CRYSOL fit 
of an enhanced model featuring the disordered N-terminal region residues modeled by EOM (red line). The three corresponding enhanced models 


calculated by EOM are shown also in the inset in (d). 
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recommended typical values of the wavevector Sin = 7/Dmax 
(0.5 nm™*) and Sax = 7/R, (3.5 nm~*) (Svergun, 1992). 

In Fig. 2(b) we show the output of the GNOM numerical fit 
over the merged profile, while Fig. 2(c) shows the corre- 
sponding distance distribution P(r). The latter smoothly 
converges to zero for a maximum particle dimension of Dax = 
6.49 nm. The measured dimer dimension of the crystallo- 
graphic structure that we determined (PDB entry 6g13) is 
smaller at 5.6 nm. However, ten residues in the N-terminal 
region of the monomer were not resolved in the crystallo- 
graphic structure (Thr237—Lys247). These residues contribute 
to the extra overall length of the molecule. To verify this 
hypothesis, we modeled these regions using LOM analysis 
(Ensemble Optimization Method; Bernardo et al., 2007). Using 
Coot, the ten missing residues were added to the globular part 
of the molecule and were modeled by KOM keeping the 
globular part fixed. The KOM calculations resulted in three 
possible molecular conformations. CRYSOL fits the measured 
scattering curve with a simulated scattering curve generated 
using a known X-ray crystal structure. In Fig. 2(d) we show 
two CRYSOL fits using the two PDB model structures with 
and without additional amino acids at the N-terminus. More 
precisely, the CRYSOL fit shown as a blue continuous line 
corresponds to the crystallographic dimer structure (PDB 
entry 6g13) with no addition of any supplementary amino 
acids. The red continuous line is the CRYSOL fit corre- 
sponding to a mixture of the three molecular conformations 
calculated by EOM after the addition of the missing amino 
acids to the N-terminal region of each monomer. At the 
bottom of the figure, the difference between the data and the 
CRYSOL fit is shown. The blue difference spectrum corre- 
sponds to the CRYSOL fit of PDB entry 6g13, while the red 
difference spectrum corresponds to the CRYSOL fit of the 


Figure 3 


structure calculated by KOM. We observe that in the higher 
resolution range (between 2 and 3.5 nm‘), consideration of 
the enhanced model calculated by KOM improves the fit. This 
result suggests that the core domain is a dimer in solution, for 
which a certain flexibility is expected at the N-terminus. 


3.2. Structure of the MERS-CoV CTD 


A dimer of the MERS-CoV CTD was crystallized. Crystals 
of an apparent cubic shape with a side length of 150 um were 
obtained after 3-5 days. The MERS-CoV CTD crystallized in 
space group P3,21. The asymmetric unit was constituted of 
two identical dimers rotated by 90° around two axes, with the 
second dimer being in the center of the first (Fig. 3a). The 
structure was refined to 1.97A resolution. The dimer is 
formed by two molecules related by a noncrystallographic 
twofold axis and overall looks like a rectangular slab (Fig. 3D). 
An a-helix bundle is on one side of the slab, while the other 
side is formed by the flat surface of a four-stranded /-sheet 
flanked by two flexible loops. Each monomeric subunit has an 
(a+) fold constituted of five short a-helices and four turns (7) 
forming a semi-ring, from which two antiparallel 6-strands 
protrude (Fig. 3c). It is interesting to note that several residues 
which are not in contact with adjacent protomers present a 
high degree of flexibility, presumably leading to poor electron 
density. Indeed, in the final refined structure we were unable 
to build several residues of the N-terminus (Thr237—Lys248) 
or of the loop between the two strands (Asp320-—Gly322). This 
is consistent with the SAXS analysis (see Section 3.1) and 
shows that the crystallographic structure corresponds to a 
biological assembly behaving as in solution. 

The analysis of the hydrophobic surface of a monomer 
(Fig. 4) shows the presence of three hydrophobic patches that 


The MERS-CoV CTD structure in different orientations. (a) Asymmetric unit illustrating the two dimers at 90° rotation about two axes (dimer 1 is 
shown in green and red, and dimer 2 is shown in yellow green and blue). (6) Clipped monomers shown in ribbon representations and in different 
orientations, showing how the two chains (green and cream) are intertwined. (c) Ribbon representation of the MERS-CoV CTD structure. Helices are 
colored magenta, strands are colored green and random coils are colored cream; secondary structures are labeled. 
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interact with the secondary-structure elements of the other 
protomer. The first patch on the top interacts with helix a5 
(Fig. 4), the second patch stacks the core of the two f-strands 
(Fig. 4) and finally the last patch opposite to the second patch 
interacts with the N-terminal pseudo-turn 71 (Fig. 4). We can 
infer from this analysis that the driving force of the dimer- 
ization is hydrophobic, giving the overall idea of a ‘clipping’ 
mechanism. The core secondary structures of each protomer 
are tightly intertwined, making this dimer a cornerstone 
element in the assembly of the ribonucleoprotein, the overall 
structure of which still needs to be defined. Based on the 
recent EM analysis (Gui et al., 2017) and prior crystallographic 
studies on homologous domains (Yu et al., 2006; Jayaram et al., 
2006; Ma et al., 2010), this CTD dimer would be the building 
block of the assembly of a polymeric ribonucleoprotein. 
Indeed, this assembly is compatible with a head-to-head 
subunit leading to a helical filament. In this proposed model 
the growth of the filament is sideways (Ma et al., 2010). 

The unique asymmetric unit arrangement observed in the 
MERS-CoV CTD (Fig. 3a), in which the middle of the a-helix 
bundle of one dimer can interact with the side of the second 
dimer, suggests that a dimer could prevent the addition of an 
another ‘sideways dimer’. In such a scenario one dimer could 
act as a ‘cap’ to prevent further polymerization or regulate the 
erowth of the filament (Supplementary Fig. $1). This type of 


g0° 


Figure 4 


regulation is reminiscent of the growth control of actin fila- 
ments by steric hindrance at one end (Shekhar et al., 2016). 


3.3. MERS-CoV CTD sequence and structure conservation 


Sequence comparison against the PDB shows that the 
MERS-CoV CTD has 23-46% identity to homologous 
domains, including those from IBV, HCoV-NL63 and SARS- 
CoV. The overall alignment (Supplementary Fig. S2a) shows 
8.9% identity, highlighting the key residues that are 
conserved. Considering their positions in the structure, either 
buried or in loops, we infer that these residues play a structural 
role to mainly ensure proper folding and/or dimerization 
(Supplementary Fig. S2b). Superposition of the corresponding 
structures (Supplementary Fig. S3) with the MERS-CoV CTD 
shows that the folds are virtually identical, with the exception 
of the flexible ends and the insertion into the 6 loop. 


3.4. MERS-CoV CTD surface electrostatic conservation 


In spite of low overall sequence conservation, the proper- 
ties of the residues are fairly conserved, with an overall 
similarity of ~60%. Analysis of the electrostatic surface 
highlights these similar characteristics between the MERS- 
CoV, SARS-CoV and IBV CTDs, and almost an inversion of 
the charges in the HCoV-NL63 CTD. Indeed, in the first three 


150° 


150° 


Hydrophobicity analysis of MERS-CoV CTD. (a) Surface hydrophobicity showing the hydrophobic potential of a monomer represented by a gradient 
colored from magenta (high) to cyan (low). (b) Surface representation of the MERS-CoV CTD surface hydrophobicity with the second chain of the 
dimer. 
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Orientation MERS IBV SARS HCoV-N63 
IBV PDB entry 6913 PDB entry 2ca1 PDB entry 2jw8 PDB entry 5epw 
PDB entry 2ca‘1 
Figure 5 


MERS-CoV CTD surface electrostatic comparison with homologous viral domains. The structures presented are from IBV (PDB entry 2ca1; Jayaram et 
al., 2006), HCoV-NL63 (PDB entry Sepw; Szelazek et al., 2017), SARS-CoV (PDB entry 2cjr; Chen et al., 2007) and MERS-CoV (PDB entry 6¢13). The 
surface electrostatic potential calculated by APBS is shown from —7kT/e (red) to +7kT/e (blue). 


dimers the flat side is mostly neutral, while in HCoV-NL63 the 
charge is negative (Fig. 5, bottom row). On the a-bundle side, 
a strong positively charged patch is also observed bordering 
the top and bottom of the dimer, while for the HCoV-63 CTD 
the bottom and a-bundle sides are either mostly neutral, with 
a small positive patch in the center of the dimer (Fig. 5, top 
row). These differences are surprising considering that the 
sequence of the HCoV-NL63 CTD is not the most distant from 
that of the MERS-CoV CTD (Supplementary Fig. S2c). 
However, functional studies underscore the role of the CTD in 
the recognition of the packaging signal in coronaviruses such 
as Mouse hepatitis virus (Kuo et al., 2014). In particular, 
MERS-CoV CTD 1s able to bind a 45-nucleotide stem-loop 
structure of the packaging signal sequence (Hsin et al., 2018). 
This observation is consistent with previous observations on 
SARS-CoV N, where the C-terminal domain alone is able to 
bind RNA but would rather act in cooperation with other 
RNA-binding sites along N for formation of the ribonucleo- 
capsid (Chang et al., 2009). In contrast, the CTD of HCoV- 
NL63, which lacks the positively charged area observed in 
MERS and SARS-CoV CTD, is not, or is at least poorly, able 
to bind RNA (Zuwala et al., 2015). Therefore, the structural 
information described in this study may help to reveal the 
residues involved in the specific recognition of the packaging 
signal or possible differences in the mechanisms of RNP 
assembly. 

Aside from its structural role in RNA packaging, MERS- 
CoV N could also be involved in disease severity by up- 
regulating the cytokine CXCL 10 (Aboagye et al., 2018). The 
C-terminal part of N (amino acids 196-413) is sufficient to 
overexpress CXCL10 and the differences between the charge 
organization on the CTD in viruses leading to severe 
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syndromes (SARS-CoV and MERS-CoV) and those leading 
to mild symptoms (HCoV-NL63) could also be investigated in 
light of the infection outcomes. 

Overall, the structural determination of the MERS-CoV 
CTD reported here allows it to be proposed that the MERS- 
CoV CTD plays a key role in the formation of the RNP ina 
similar fashion to other CTDs, and that the CTD might also 
recruit specific RNA sequences by a mechanism that remains 
to be elucidated. 
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